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Abstract 

Let < a < 1/2. We show that that the mixing time of a continuous-time Markov 
chain on a finite state space is about as large as the largest expected hitting time of 
a subset of the state space with stationary measure > a. Suitably modified results 
hold in discrete time and/or without the reversibility assumption. The key technical 
tool in the proof is the construction of random set A such that the hitting time of 
A is a light-tailed stationary time for the chain. We note that essentially the same 
results were obtained independently by Peres and Sousi. 

1 Introduction 

The present paper is a contribution to the general quantitative theory of finite-state 
Markov chains that was started in [2j and further developed in [4j. The gist of those 
papers is that the so-called mixing time of a Markov chain is fundamentally related, 
in a precise quantitative sense, to hitting times and other quantities of interest. Our 
main achievement is to add a new equivalent quantity to this list by showing that 
mixing times nearly coincide with maximum hitting times of large sets in the state 
space. 

Remark 1 (Important remark) The results in this paper were proven (but not 
made public) around May 2010. In July 2011 we learned that extremely similar results 
for discrete-time chains have been proven independently by Peres and Sousi We 
then decided to submit our results, in the hope that our ideas might also be found 
useful and interesting. We will discuss their results at several points in our paper. 
Here we just mention that the main difference between the papers is the construction 
of the stopping time in Lemma[l\ (see Section \1.1\) . 

*IMPA, Rio de Janeiro, RJ, Brazil, 22430-040. Work supported by a Bolsa de Produtividade em Pesquisa 
and by a Pronex grant from CNPq, Brazil. 



We need to introduce some notions before we clarify what we mean; j3] and [5] 
are our main references for the involved concepts. In this paper E will always denote 
the finite state space of a continuous-time Markov chain with generator Q, with 
transition rates q{x,y) {x,y €z E, x ^ y). Most of the time Q and E will be implicit 
in our notation. The trajectories of the chain are denoted by {Xf}tyQ, and the law of 
{Xt}t>o started from x G E ot from a probability distribution over E are denoted 
by Wx or (respectively) . For t ^ 0, we write: 

Pt{x,y)=Wx{Xt = y) {x,yeE) 

for the transition probability from x to y at time t. In what follows we will always 
assume that Q is irreducible, which implies that it has a unique stationary distribution 
TT and: 

y{x,y)eE^ : lim pt{x,y) = TT{y). 

We can measure the rate of this convergence after we introduce a metric over proba- 
bility distributions. We choose the total variation metric: 

c^Tv(/^> ^) = ™ax \n{A) — h'{A)\ = - S~] \fi{a) — i^{a)\ (/i, v prob. measures over E) 
A<zE Z — ^ 

aG_E 

and define the mixing time of Q as: 

T^iJ5) = inf{t > : Vx € ii;, (iTv(pt(x, •), vr(-)) < 5}- 

Finally, given 7^ A C i?, we may define the hitting time of A as: 

Ha = inf{t >0 : XteA}. 

Results for reversible chains. Recall that Q is reversible if 7r(x)g(x, y) = ■K{y)q{y,x) for 
all distinct x,y & E. In this setting, Aldous proved: 

Theorem 1 (Aldous, L2j) There exist universal (ie. chain independent) constants 
c_,C-|_ > such that for any irreducible, reversible, finite- state- space Markov chain 
in continuous time with generator Q: 

C-Tg,<TL(l/4)<c+T« 
where T^;,^ = sup{7r(^) [Ha] ■ x e E, ^ A C E}. 

Notice that T^^^ = 1 if Q consists of iid jumps at rate 1 between states in E, so T^j^ 
can be viewed as a measure of how "non-iid" the chain is. Informally, the mixing time 
is another measure of "non-iid-ness" , and the Theorem shows that these two measures 
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are quantitatively related in a very strong sense. We emphasize that Theorem [T] is 
part of a much larger family of universal inequalities for reversible Markov chains; 
see [2] for details. 

In this paper we prove a stronger form of Theorem [TJ Given a > 0, let: 



Unlike Tj^j^, only "large enough" sets are considered in this definition. We prove in 
Section m that: 

Theorem 2 For any < a < 1/2 there exist constants C+{a),C-{a) > depending 
only on a such that, for any irreducible continuous-time Markov chain as above: 



Although similar to Theorem[Tl the intuitive content of Theorem [2] seems different: 
instead of measures of non-iid-ness, we have a statement that says that mixing times 
are about as large as the expected time necessary to hit any large set, which is quite 
reasonable. Theorem [2] should also be easier to use in applications. The condition 
a < 1/2 is discussed in Section [LTl 

Remark 2 Theorem\M also holds in discrete time if pi{x,x) > 1/2 for all x ^ E 
(use |5l Theorem 20.3]). Peres and Sousi have shown that pi{x,x) > /3 > for 
any fixed /3 > also suffices. Some lower bound onpi{x,x) is necessary; otherwise 
there are counterexamples such as large complete bipartite graphs with an edge added 
to one of the parts. 

Results for non-reversible chains. Theorem [2] and the main results of [2] only apply 
to reversible chains; counterexamples can be found in that paper. Aldous, Lovasz 
and Winkler [4] developed a quantitative theory in the general case using a different 
notion of mixing time. Let Mi([0, t]) be the set of all probability measures over [0,t] 
and define: 



In discrete time, one replaces Mi([0,t]) with the set Mi({0, . . . ,t}) of all probability 
measures over {0,...,t}. Aldous, Lovasz and Winkler [1] proved an analogue to 
Theorem [1] for arbitrary Markov chains in discrete time, where Trmix replaces Tmix 
(their method can also be applied in continuous time). We prove an analogue of 
Theorem [2] in this setting: 



T'^^.ia) = sup{E, [Ha] : x e E, ^ A C E, 7r{A) > a}. 
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Theorem 3 For any a G (0,1/2) there exist C'__{a) > 0,C'^{a) such that for any 
irreducible finite-state Markov chain Q in continuous time: 

CUa)Tg,(a) < T?^,Jl/4) < C;(a)Tg,(a). 

Remark 3 Our proof can be easily adapted to discrete time. Peres and Sousi 
have proved a variant of Theorem \^ where T^-_jj^(l/4) is replaced by another notion 
of time-averaged mixing, with ji a geometric distribution with success probability 1/t. 

1.1 Discussion of the results 

Outside of potential applications to bounding mixing, Theorems [2] and [3] seem concep- 
tually interesting. They show that mixing times are natural in that they are strongly 
related to hitting times, a quantity of intrinsic interest. For instance, we have the 
following immediate corollary of Theorem [31 

Corollary 1 There exists some universal C > such that for any irreducible Markov 
chain in discrete or continuous time, 

^ ^ ^ ~ tt{A) 

We omit the proof, which follows from T^j^ < cTrmix(l/4) < c'T^i^^(l/3) (with 
c, c' > universal) . This result says that one may control the hitting times of small 
sets via those of large sets.sOther applications of (slight variants of) our theorems 
are considered in [9]. 

The limitation a < 1/2 is not clearly necessary for the Theorems to hold. However, 
Peres [8] noted that one cannot allow a > 1/2. In that case one may contradict 
the two theorems by connecting two complete graphs by a single edge. In this 
case Thit(a) = O (n) whenever a > 1/2, since any set A with tt{A) > occupies a 
cosntant proportion of the mass of each clique. However, mixing requires crossing the 
connecting edge, so Tmix(l/4) = Q (Ti.inix(l/4)) = 0, (n^). The intersting question is 
then: 

Question 1 What happens when a = 1/2? 

In 2009 Peres conjectured that T^j^.(l/2) is also "equivalent up to universal con- 
stant factors" to T^j^(l/4) (for lazy and reversible Q) and Tji^j^(l/4) (in general) 
[1]. We prove this result in an upcoming paper with Griffiths, Kang and Patel. 
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1.2 Steps of the proof 

The main step in the proof is Lemma [U proven in Section [2j We construct there a 
randomized stopping time T, which depends on the initial distribution, such that Xt 
has the stationary distribution. This stopping rule is the hitting time of a randomly 
chosen subset A C E, where the possible values of A form a chain D ^2 ^ • • • ^ 
An- We will see that this property property implies that we can control the tail of 
Ha via T^j^(a). We note that this stopping time was outlined in [7, Theorems. 4] and 
[6l Theorem 4.9], but it is not explicit anywhere. Moreover, results in [7J imply that 
T is minimal in some sense (cf. Remark [5]). Peres and Sousi ^ prove similar results 
via another minimal stopping rule, the so-called filling rule that was also employed 
in [2113]). We believe that our construction provides an interesting alternative point 
of view. 

Ater the construction of T, our paper continues with the proofs of Theorem [3l 
proven in Section [3l The elegant argument we use argument employs Lemma [U 
together with a simple coupling devised in the survey [6] . The proof of Theorem [2] in 
Section U] follows a convoluted computation in [2] , which we reproduce in order to get 
the sharp form we need. An Appendix presents a simple lower bound of T^j^(a/2) 
in terms of T^j^(a). 

1.3 Acknowledgements 

We thank Yuval Peres for the counterexample in Section 11.11 [8] and both him and 
Perla Sousi for presenting [9] to us. 

2 A special stationary stopping time 

We use the notation in Section [TJ Recall that a randomized stopping time for this 
chain is a [0, +oo)-valued random variable T such that for alH > the event {T < t} 
is measurable relative to the a- field generated by {Xs}s<t and an independent random 
variable U. 

Lemma 1 Suppose hq is a probability measure over E. Then there exists a random- 
ized stopping time T with 

{Xt = ■) = vr(-) and P^^ {T>t)<e + for all e G (0, 1). (1) 

Remark 4 The same result works (with a slightly different proof) if ti is replaced by 
another target distribution /Ui over E and vr substitutes fii in the definition o/T^j^(e). 
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Remark 5 Although we do not use this, one can show that E^j^ [T] is minimal among 
all randomized stopping times with (Xt = •) = 7r(-). This is because our T has a 
halting state Theorem 4-5]- 

Remark 6 We note from the definitions that T^j^(e) < T^^^/e. We may plug this 
into Lemma {1\ and optimize over e to deduce: 



Aldous proves a similar bound for a different stopping time, which he uses to 
prove TheoremUl The same proof would go through with our own T. Another proof 
of Theorem [I] is presented in 

Proof: Let n = \E\ denote the cardinality of E. The idea in the proof is to find a 
chain of subsets E = Ai D A2 D • • • D An 7^ and numbers pi, . . . ,pn > with 
X^jPi = 1- We then define a random A that equals Ai with probability pi and define 
T = Ha- We will then show that if {Xt}t>o is a realization P^g that is independent 
from A, then Law(X7^) = tt. The tail behavior of T = Ha will follow automatically 
from the construction. 

Notation. For any set 7^ S" C let ps{-) = P^jq {^Hs = ') denote the harmonic 
measure on S for the chain started from fiQ. The irreducibility of the chain implies 
that Hs < +00 P^p-a.s. and therefore ps is a probability measure over E with sup- 
port in S. 

Inductive construction of (Aj, Pi): Set = and choose a 1 G j4i so that (oi )/'?'" (^i) 
is the maximum of /9Ai(a)/7r(a) over all a G Ai. Since the vr- weighted average of such 
ratios satisfies: 



the maximal value must satisfy (ai)/7r(ai) > 1. We then choose pi = n^ai)/ pAiiai) 
and note that pi G [0,1], pipAi{ai) = '?r(ai) and pipAiia)/7r{a) < 1 for all other 
a G E\{ai}. 

Assume inductively that we have chosen distinct elements ai , . . . , G E and 
numbers < pi, . . . ,pk < 1 such that if Ai = E\{aj : 1 < j < i} {1 < i < k), we 
have the following properties: 





1. for ah 1 <j <k, Yli=iPiPA^{aj) = 7r(aj); 

2. moreover, for a G E\{ai, a^}, Yli=iPiPAiia) < 7r(a). 
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Assume also that k < n, so that Afc+i = E\{ai, . . . , ak} is non-empty. We will prove 
that one may choose {pk+i,CLk+i) so as to preserve these properties for one further 
step. The following claim is the key: 

Claim 1 The set Vk+i C [0, 1] x ^^+1 of all {p, a) with YH=iPiPAi{a) +PPAfc+i(a) = 
7r(a) is non-empty. 

Given the claim, we choose a pair (pfc+i, a^+i) G Vk+i with minimum value of 
the first coordinate. Let us show that condition 2. above remains valid for a G 
E\{ai, . . . , Ofc+i}. Any a violating 2 would have to satisfy: 

k k 

^PiPAiia) < 7r(a) < Pk+ipA^+M) + ^PiPAi{a), 
i=l i=l 

and this would imply that there is some < p < Pk+i with: 

k 

PPAk+i{a) + '^PiPAiia) = 7r(a) (ie. {p,a) G Vk+i), 
1=1 

which would contradict the minimality of p^+i. 

To prove that condition 1. also remains valid, we simply observe that it certainly 
holds for ajt_|_i and that it also holds for ai, i < k + 1, because ^ ^fc+i and therefore 
pAk_^.i{ai) = . Hence such a choice of Pfe+ij^^fe+i preserves the induction hypothesis 
for one more step. 

We now prove the Claim. Notice that: 

Since the first term in the LHS is an average, there must exist some a G A^+i with 
PAk+iia) > ■""(a), whence: 

k 

^PiPAiia) + PAk+^a) > 7r(a). 

i=l 

Moreover, the inductive assumption 2. implies that Yli=iPiPAi{0') ^ ''"(a), so there 
exists some p G [0, 1] with 

k 

^PiPaM) +PPAk+^{a) = 7r(a), 
1=1 
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which proves the claim. 



Analysis of the construction. Carrying the induction to its end at A; = n imphes that 
there exist pi, . . . ,pn € [0, 1] and an ordering ai, . . . ,an of the elements of E such 
that, if Ai = E\{aj : 1 < j < i}, then: 

n i 

VI < z < n,7r(aj) = ^^pjPA^iai) = '^pjPAj{ai) 
j=i j=i 

(the last identity in the RHS follows from Aj for j > i). 

These are the only facts about the construction we will use in the remainder of 
the analysis. We now prove some consequences of these facts. First notice that: 

n n n n n n 

'^Pj = ^Pj ^PA,{ai) = ^^PjPAj{ai) =^TT{ai) = 1, 

j=l j=l 1=1 i=l j=l i=l 

which implies that the pi form a probability distribution over {1, . . . ,n}. Moreover, 
the same line of reasoning implies that for all /c E {1, . . . , n}: 

k k / k \ k k k 

^Pj >^Pj lYl P"^^ ) = ^^PiPAj (°^») = = ^ ~ 7r(Afc+i), (2) 

j=i j=i \j=i / j=i j=i 1=1 

where An+i = by definition. 

We now define our randomized stopping time as T = Ha, where the choice of 
A is independent of the realization of the chain and ¥ {A = Ai) = Pi, 1 < i < n. 
Notice that A ^ hence T < +oo almost surely. Moreover, it is easy to check that 
(Xt = ■) = 9'S desired. 

To finish, we bound the upper tail of T. Given e G (0, 1), let j(e) be the largest 
J € [n + 1] with iT{Aj) > e (recall our convention An+i = 0). Since the Ai^s form a 
decreasing chain, ^ implies: 

Pmo (vr(^) >e) = ^FiA = A,) = ^.Pi > 1 " <Aj^e)+i) > 1 " 

i=l 1=1 

Moreover, j < j{e) imples Aj D ^j(e)- We deduce: 

P^o iT>t) < F^, {7r{A) < e) + F^, {HA>t\ tt{A) > e) 
< e + F,,(HA^^^,>t] 



< e+ 



Ha, 



t ~ t 

□ 
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3 Mixing of non-reversible chains 



In this section we prove Theorem [3j 

Proof: [of Theorem [3] The lower bound on T^j^(a) fohows easily from the ideas in 
0]. We give a proof in the Appendix for completeness. For the upper bound, we 
proceed as follows. Define: 

dr{t) = inf sup dxv ( / Ps(.x, ■) fj,{ds), ps{z, ■) fi{ds) ] . 
/ieMi([0,t])^,^6£; \Jo Jo J 



Claim 2 For all t > 0, 

dr{kt) <dr{tf. 

Proof: [of the Claim] A standard compactness argument shows that there exists a 
measure /i which achieves the infimum in the definition of dy{t). Let M be the 
discrete time Markov chain whose transition probabilities are given by: 

m{x,y)= / psix,y) ii{ds), {x,y)eE^. (3) 
Jo 

Define: 

dM{k) = sup dTy{mt{x,-),int{y,-)) 

where mt is the transition probability for t steps of m. Notice that dpii^) = dr{t) 
by the choice of /i. Moreover, dr{kt) < dM^k) because k steps of M correspond to 
replacing ^ in ([3]) by its /c-fold convolution with itself /i**. Lemma 4.12 in [S] implies 
that 

dr{kt) < duik) < dM{lf = dr{tf. 

□ 

Notice that dr{t) < 1/4 implies T^j^(l/4) < t. We will spend most of the rest of 
the proof proving that for all irreducible Markov chains Q, 

Goal: dr (c{a) Tg^(a)) < 1 - 6{a), (4) 

where c{a),6{a) > depend only on a € (0,1/2). Applying the Claim with t = 
c{a) T^j^(a) and k = k{a) such that (1 — 6{a))^ < 1/4 we may then deduce that 

T^j^(a) < C+(a)T^j^(a) where C_|_(a) = k{a) c{a) depends only on a, 
which is the desired result. 
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Given x,z ^ E, we let {Xt}t>o and {Zt}t>o denote trajectories of Q started from 
X and z (respectively). Let Tx, Tz be obtained from Lemma [T] for /io = 5x and 6z 
(resp.). Clearly, 

Law(XT^) = Law{ZT^) = ir. 

Sample U uniformly from [0, t] and independently from the two chains. The Markov 
property and the stationarity of vr imply: 

Now fix some t > and define 

Ux = {Tx + U) mod t and Uz = {Tz + U) mod t. 

Notice that Ux is uniform over [0,t], independently from {Xt}t>o, and similarly for 
Uz- Hence: 

Law{XuJ = / •) and Law(ZwJ = / Ps{z, ■) iJ.{ds), 

Jo Jo 

where is uniform over [0,t]. Therefore, 

dTV i Ps{x,-)Kds), ps{z,-) fj.{ds)] = dicy{Law{XuJ,Law{Zu,)) 



Jo 



< dTy{Law{XuJ,Law{XT^+u)) (5) 
+(iTv(Law(ZwJ, Law(ZT,+w)) 

by the triangle inequality and the previous remarks. We now show that: 



dTY{Law{XuJ,La^{XT,+u)) < « + (6) 

This is of course trivial if t < T^^^{a), so we assume the opposite is true. The coupling 
characterization of total variation distance implies that for any A € (0, 1): 

dTy{Law{XujM^{XT^+u)) < Fx{Xu^^Xt^+u) 

< Fx{U>t-Tx) 

< Fx{Tx > Xt) + ¥{{1 - X)t <U <t) 

(use Lemma HI = a + ^'^ + A 

At 
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Choosing A = y TJ^^ (a) / 1 gives ([6]) . We plug this and the corresponding statement 
for Zt^j^u into ^ to deduce: 



Tgt(«) 



^TV Psix,-) Kds), J Psiz,-) Kds)^ <2a + 4]^ ^ 
Now recall that a < 1/2 and take 

For this value of t, we have: 

^TV Ps{x,-) Kds), J Ps{z,-) Kds)^ < 

Since X, z are arbitrary, we deduce dH) with c(a) =64/(1— 2a;)^ and (5(a) = (1 — 2a)/2. 
□ 



4 Mixing of reversible chains 

We now prove Theorem [2l 

Proof: [of Theorem [2] Notice that T^j^(a) > T^-^(a), so the lower bound in the 
Appendix also applies here. For the upper bound, we first define: 

d{t) = sup dTYipt{x,-),pt{z,-)). 

x,z<^E 

It is well-known that d is submultiplicative [21 Chapter 2] and that d{t) < 1/4 implies 
T^j^(l/4) < t. In light of this, we need to show that: 

Goal: d (c(a) Tg^(a)) < 1 - 6{a), (7) 

where c(a), 5{a) > depend only on a G (0, 1/2). 

Basic definitions for the proof. Let U > L > (we will choose their values later). Fix a 
pair x,z G E and let {Xt}t>o and {Zt}t>o denote trajectories of Q started from x and 
z (respectively). Also let Tx,Tz be the randomized stopping times given by Lemma [1] 
for the X and Z processes, and define rix,r]z to be the probability distributions of 
{Xt^ , T^) and (Zt, , T^) over Ex [0, +oo). Finally, we let /^(a) = (Xt, = a,T^ < L) 
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and f,{a) = {Zt, = a,T, < L) (aeE). 
Estimating total variation distance. Recall: 

dTYiPt{x,-),Pt{z,-)) = \Pt{x,a) -ptiz,a)\ 



Notice that: 

pt{x, a) = {Xt = a, T.^ < L) + {Xt = a,T^ > L) , 
and similarly for ptiz, a). Therefore, 

dTY{pt{x,-),Pt{z,-)) < ^^lPx-(Xt = a,r, <L)-P,(Zt = a,r, <L)1 

ag_E 

+ i i^t = a,T,>L)- P, {Zt = a,T,>L)\ 



< 



E 



(P, (Xt = a,T,<L)- P, (Zt = a,T,< L))' 



Ti{a) 



+\ Yl = a,T,>L)- P, {Zt = a,T,> L) \. (8) 



where the last line uses the Cauchy Schwartz inequality. We may further bound: 
5^|P,(Xt = a,T, >L)-P,(Zt = a,r, >L)| < ^ P, (X* = a, > L) 

a£E 

+ Y,^z{Zt = a,T, > L) 



a€E 



a&E 



< P,(T, >L)+P,(r, >L), 



and plugging this into ([8]) gives the inequality: 



d'TY{Pt{x,-),pt{z,-)) < - 



\ 



(P. (Xt = a,T,<L)-F,IZ,=a,n<L)y 



E 

aeE 

, (T, > L) + P, (T, > L) 



vr a 



(9) 



Averaging. Our next step is to average the LHS and RHS of Q over t € [L, U]. Since 
dTy{pt{x, ■),pt{z, •)) is decreasing in t [5j, the distance at time t = [/ is at most this 
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average. We use concavity to move the averaging inside the square root and deduce: 



1 

dTv{pu{x,-),Pu{z,-)) - dTv{pt{x,-),Pt{z,-))dt 



< 



U-L 



aeE 



{Xt = a,T^<L)- {Zt = a,T,< L)f 
7r(a) 

(T, >L)+P, (T, >L) 



+ 



• (10) 



The term inside the square root. Define El = -Ex [0,L]. By the strong Markov 
property: 




Pt-s{u, a)pt-s'{u', a) dr]x{u, s)dr]x{u, s'). 



By reversibiUty, we may rewrite the integrand in the RHS as 

Pt-s {u, a)Tr{a)pt-s' {a,u')/'K{u'), 

which imphes that: 

¥xiXt = a,Tx<Lf 



E 

aeE 



7r(a) 



I El J El 



'!r{u') 



P2t-.'-s'{u,u') 



dr]x{u,s)dr]x{u',s'). 



I El J El 7r(«') 

Integrating over t (with the change of variables t' = 2t — s — s'), we find that: 



1 



U-L 



E 



Fx{Xt = a,Tx<Ly 



dt 



vr a 



El J El X'^U - 2L JaL-s-s' 7r(M') 



2U-S-S' 



Pt'{u,u') , 



■J— dt drjx{u, s)drix{u , s ) 



< 




El J El \2U - 2L Jq 



2U 



P^l^!^dA dr,x{u,s)dr,x{u',s') (11) 
7r{u') J 
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where the last inequahty follows from the fact that [2L — s — s' ,2U — s — s'] C [0, 2U], 
which holds for all s, s' in the range considered. With this the bracketed term becomes 
independent of s, which may be integrated out. Since: 



/ drixiu, s) = fx{u) < 7r(n), 

J{u}x[0,L] 



we obtain: 



1 f'^sr^ ^x {Xt = a,T^<Lf 



2U 



r-2L 



as well as a similar bound for z. On the other hand, starting from the formula: 
{Xt = a,T^< L) {Zt = a,T,< L) 

= / Pt-s{u,z)pt-s'{u',z)drjx{u,s)dr]z{u',s') 

averaging over t G [L, U] and using [2L — s — s', 2L + 2C/ — s — s'] D [2L, 2?7], we may 
obtain: 

1 r V- i^t = a,T^ < L)F, {Zt = a,T,< L) 

''^ aeE ^ ' 

Combining these bounds we obtain 



z6£ 

r2U N 2L 



< E (/.w-/.w)( ^''"](;,f''''' )(^/^ p.(»,»')ciw) + 



U-L 
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To bound the sum in the RHS, we notice again that fx{-),fz{-) < '?'"(•)) ^.Iso 
that for aU u E, '^^i Pwiu,u') = 1- Hence 



nG-B 

Now recall that 

Uu) = P, (Xt^ = u, T, < L) = 7r{u) - (Xt, = u,T, > L) 
and similarly for z, so that 

5] - /.(n)| = ^ |P, {Xt^ =a,T^>L)- P, (Zt, = a,r, > L) |. 

uG-E aG-E 

We deduce that the term inside the square root in pO|) is bounded by: 

1 ^ (P. {Xt = a,T,<L)- P, {Zt = a,T,< L)f 

U-lJl ^ tt{z) 

2L 

< {Xt, =a,T^>L)- P, (Zt, =a,T,>L)\ + 



< ^P^ {Xt, = a,n >L) + Y,^z {Zt^ = a,T, > L) + 



U -L 

aeE aeE 

2L 

< P, (T, > L) + P, (T, > L) 



U-L 

Wrapping up. We now plug this previous inequality into ([TO]) to deduce: 
dTY{pu{x,-),Pu{z,-)) 



1 / 21 
< -Wp,.(T,. >L)+P, (T, >L) + 



2V 'U-L 

P^(T,. > L)+P,(r, > L) 
2 

If the quantity inside the square root is < 1, we get another upper bound: 



2L 

dTv{pu{x, ■),Pu{z, •)) < \l^x (T, >L)+¥, (T, > L) + ^ (13) 
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Now by Lemma [T] 



(T, > L) + {T, > L) < 2a + 2 



L 



so choosing 



L 



1 - 2a 



and U 



I -2a V 1 - 2a 



we obtain: 



(T,. > L) + (T, > L) + < < 1. 



U - L - 2 

Thus the condition for ([13]) is satisfied, and we have the bound: 



dTyiPu{x,-),Pu{z,-)) < 



l + 2a 



Since x,z E are arbitrary, we deduce: 

2' 



d 



+ 



8 



l-2a VI -2a 
which has the form requested in ([7]). □ 



T^itH <i- 1 



l + 2a 



Appendix: the lower bound 



In this section we prove the lower bound part of the main theorems. As above, 
Q is a irreducible continuous-time Markov chain with state space E and stationary 
distribution vr. The trajectories of the chain are denoted by {Xt}t>o 

Proposition 1 For any a € (0, 1), T^jj.(a) < c(a)T^j^ where c(a) > depends only 
on a. 

Proof: It follows from Claim [2] that: 

TiiJl/2'=)<fcT^,,Jl/4). 

In particular, 

Tf^,M < (log2(l/«) + 1) T?„,i,(l/4). 
Thus it suffices to show that T^^^ia) < {2/a)Tf^-^{a/2). 



16 



Fix A <zV with measure vr(^) > a and x (^V. By the definition of T^-j^(a/2) and 
a simple compactness argument, there exists a distribution supported on [0, T^-^(a/2)] 
such that if U has this distribution and is independent from {Xt}t, 

dTv(Law(Xw),7r) < 1 - a/2. 

As a result, 

Fx {Xu^A)<l- tt{A) + dTY(Law{Xu),n) < 1 - 2 ■ 

Since Z^/ is supported in [0, T^j^(a/2)], 

{HA>Tf^-Ja/2)}c{Xu^A}, 

and we deduce: 

yxeV,yAcV with tt{A) > q : Px (i^A > T^mix(«/2)) < 1 " (14) 



Let us use this to show that E^. [H^] < (2/q) Tji^j^(a/2) for all x and A as above. 
Let /c G N\{0} and denote by A^, the law of X^^,_^^^q ^^^2) conditioned on {ff^ > 

(^-l)T^Lix(«/2)}- By (m. 



'A.(^A>T,^^Ja/2)) <1-" 



whereas by the Markov property, 

Ha > kTf^^M^)) < Fx (Ha >{k- l)Tf^,M/2)) Pa, (Ha > T«^i.(a/2; 



< {l-^)^4HA>ik-l)T^^-Ja/2) 

( a'^^ 
(...induction...) < '2 

We deduce: 

Ex[ffA] ^y^/^ o\fe_2 

^rmixV"/^^ fc>0 

Since x &V and A CV with vr(y4) > a were arbitrary, this finishes the proof. □ 
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